fix: correct MultiTurnSample user_input validation logic #2426

harshil-sanghvi · 2025-11-15T03:40:38Z

Fixed validation bug where generator expression was not being evaluated.

Changed from checking generator object to using all() to properly validate
all messages are instances of HumanMessage, AIMessage, or ToolMessage.

Added tests to verify validation works correctly.

Problem Description

Problem: The MultiTurnSample.validate_user_input() method had a critical validation bug where the generator expression was not being properly evaluated. The code was checking if not (isinstance(m, ...) for m in messages): which creates a generator object that is always truthy, causing the validation to never trigger.

Impact: This meant that invalid message types could potentially pass validation if they somehow bypassed Pydantic's type checking, though in practice Pydantic's Union validation catches most cases before this validator runs. However, the validator logic itself was fundamentally broken and would not work correctly if called.

How to replicate: The bug can be seen in the code at src/ragas/dataset_schema.py:131-133 where the generator expression without all() would never properly validate the messages.

Changes Made

Fixed validation logic in src/ragas/dataset_schema.py: Changed if not (isinstance(m, ...) for m in messages): to if not all(isinstance(m, ...) for m in messages): to properly evaluate all message type checks
Added comprehensive tests in tests/unit/test_dataset_schema.py:
- test_multiturn_sample_validate_user_input_invalid_type(): Verifies that invalid message types are properly rejected
- test_multiturn_sample_validate_user_input_valid_types(): Verifies that valid message types are properly accepted

References

File changed: src/ragas/dataset_schema.py (line 131-133)
Tests added: tests/unit/test_dataset_schema.py (lines 201-226)
Related code: The validator is part of the MultiTurnSample class which is used throughout the codebase for multi-turn conversation evaluation

Fixed validation bug where generator expression was not being evaluated. Changed from checking generator object to using all() to properly validate all messages are instances of HumanMessage, AIMessage, or ToolMessage. Added tests to verify validation works correctly.

anistark

Thanks for the PR @harshil-sanghvi
Looks good overall.

Can you check why CI is failing.

run make run-ci locally and you can fix it.

harshil-sanghvi · 2025-11-17T18:22:56Z

@anistark make run-ci passes locally
======================== 579 passed, 13 skipped, 2 warnings in 27.94s =========================
All CI checks passed!

anistark · 2025-11-17T21:08:53Z

@anistark make run-ci passes locally ======================== 579 passed, 13 skipped, 2 warnings in 27.94s ========================= All CI checks passed!

The ci is still failing. So, it's missing something. Check the failed job. You might get some ideas.

anistark · 2025-11-20T11:54:22Z

@harshil-sanghvi Might be unrelated issue, please rebase with main. The ci should have been resolved.

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Nov 15, 2025

anistark reviewed Nov 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: correct MultiTurnSample user_input validation logic #2426

fix: correct MultiTurnSample user_input validation logic #2426

harshil-sanghvi commented Nov 15, 2025

Uh oh!

anistark left a comment

Uh oh!

harshil-sanghvi commented Nov 17, 2025

Uh oh!

anistark commented Nov 17, 2025

Uh oh!

anistark commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: correct MultiTurnSample user_input validation logic #2426

Are you sure you want to change the base?

fix: correct MultiTurnSample user_input validation logic #2426

Conversation

harshil-sanghvi commented Nov 15, 2025

Problem Description

Changes Made

References

Uh oh!

anistark left a comment

Choose a reason for hiding this comment

Uh oh!

harshil-sanghvi commented Nov 17, 2025

Uh oh!

anistark commented Nov 17, 2025

Uh oh!

anistark commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants